On Higher-Order Control Tasks: The Application of A3C on Space Fortress
نویسنده
چکیده
In the current thesis, the extent to which the A3C architecture is able to learn higher-order control tasks is studied. The tasks consisted of a set of subtasks of the simplified Space Fortress game which vary in complexity and order of control required. Experiments in previous attempts with Deep Q-learning applications on these subtasks have shown substantial learning in the lower-order tasks, but higher-order tasks could not be mastered. The GA3C architecture was applied to the same set of subtasks that were used in the Deep Q-learning experiments. The GA3C architecture was able to master a lower-order of control, but could not master higher-order control tasks. Furthermore, GA3C did not show a significant increase in learning behaviour in higher-order control tasks when compared to the Deep Q-learning experiments.
منابع مشابه
Near-Minimum Time Optimal Control of Flexible Spacecraft during Slewing Maneuver
The rapid growth of space utilization requires extensive construction, and maintenance of space structures and satellites in orbit. 
This will, in turn, substantiate application of robotic systems in space. In this paper, a near-minimum-time optimal control law is developed for a rigid space platform with flexible links during an orientating maneuver with large angle of rotation. The time op...
متن کاملNear-Minimum Time Optimal Control of Flexible Spacecraft during Slewing Maneuver
The rapid growth of space utilization requires extensive construction, and maintenance of space structures and satellites in orbit. This will, in turn, substantiate application of robotic systems in space. In this paper, a near-minimum-time optimal control law is developed for a rigid space platform with flexible links during an orientating maneuver with large angle of rotation. The time opti...
متن کاملSpace Fortress as an Iq Test? Predictions of Learning and of Practised Performance in a Complex Interactive Video-game *
Claims that scores on pencil and paper IQ tests predict performance in easy laboratory perceptual motor rasks are weakened by methodological inadequacies. With an experimental design avoiding these weaknesses the AH 4 IQ test predicted rate of learning and performance after 5 days practice on ‘Space Fortress’ better than did age, between 18 and 36 years, or amount of previous experience at vide...
متن کاملDifferential Flatness Method Based on Pre-set Guidance and Control Subsystem Design for a Surface to Surface Flying Vehicle (TECHNICAL NOTE)
The purpose of this paper is to design a guidance and control system and evaluate the performance of a sample surface‑to‑surface flying object based on preset guidance with a new prospective. In this study, the main presented idea is usage of unique property of governor differential equations in order to design and develop a controlled system. Thereupon a set of system output variables have bee...
متن کاملGA3C: GPU-based A3C for Deep Reinforcement Learning
We introduce a hybrid CPU/GPU version of the Asynchronous Advantage ActorCritic (A3C) algorithm, currently the state-of-the-art method in reinforcement learning for various gaming tasks. We analyze its computational traits and concentrate on aspects critical to leveraging the GPU’s computational power. We introduce a system of queues and a dynamic scheduling strategy, potentially helpful for ot...
متن کامل